An Age-Threshold Algorithm for Garbage Collection in Log-Structured Arrays and File Systems
نویسندگان
چکیده
In this paper, we propose and study a new algorithm for choosing segments for garbage collection in Log-Structured File Systems (LFS) and Log-Structured Arrays (LSA). We compare the performance of our new algorithm against previously known algorithms such as greedy and cost-beneet through simulation. The basic idea of our algorithm is that segments which have been recently lled by writes from the system should be forced to wait for a certain amount of time (the age-threshold) before they are allowed to become candidates for garbage collection. The expectation is that if the age-threshold is properly chosen, segments that have reached the age-threshold are unlikely to get signiicantly emptier due to future rewrites. Among segments that pass the age-threshold and become candidates for garbage collection, we select ones that will yield the most amount of free space. We show, through simulation, that our age-threshold algorithm is more eecient at garbage collection (produces more free space per garbage-collected segment) than greedy or cost-beneet; this means that designs using age-threshold will give better system performance than designs using greedy or cost-beneet. It is also simpler to implement a scalable version of the age-threshold algorithm than to implement a scalable version of the cost-beneet algorithm. The performance of the age-threshold algorithm depends on good choice of an age-threshold; therefore, we also give an analysis which can be used to choose an optimal age-threshold under certain workload assumptions. We also suggest how to choose good age-thresholds when nothing is known about the workload.
منابع مشابه
ParaFS: A Log-Structured File System to Exploit the Internal Parallelism of Flash Devices
File system designs are undergoing rapid evolution to exploit the potentials of flash memory. However, the internal parallelism, a key feature of flash devices, is hard to be leveraged in the file system level, due to the semantic gap caused by the flash translation layer (FTL). We observe that even flash-optimized file systems have serious garbage collection problems, which lead to significant...
متن کاملFile System Performance and Transaction Support
This thesis considers two related issues: the impact of disk layout on file system throughput and the integration of transaction support in file systems. Historic file system designs have optimized for reading, as read throughput was the I/O performance bottleneck. Since increasing main-memory cache sizes effectively reduce disk read traffic [BAKER91], disk write performance has become the I/O ...
متن کاملSystems Group , Department of Computer Science , ETH Zurich Efficient Scan in Log - Structured Memory Data Stores
Novel storage solutions like RAMCloud promise never seen before latency by storing all their data in main memory. To efficiently store and retrieve objects from memory some systems reuse ideas from log-structured file systems and utilise a log like data structure to hold the data objects in DRAM. Instead of updating data in place, with log-structured memory any modification to the data generate...
متن کاملA Novel Reordering Write Buffer to Improve Write Performance of Log-Structured File Systems
This paper presents a novel reordering write buffer which improves the performance of Log-structured File Systems (LFS). While LFS has a good write performance, high garbage-collection overhead degrades its performance under high disk space utilization. Previous research concentrated on how to improve the efficiency of the garbage collector after data is written to disk. We propose a new method...
متن کاملPROFS-Performance-Oriented Data Reorganization for Log-Structured File System on Multi-Zone Disks
I/O is a major performance bottleneck in modern computer systems. Modern disks use the Zone-Bit-Recording (ZBR) technology to increase the capacity. A direct consequence of ZBR is that outer tracks have higher data transfer rates. Because LFS uses large disk transfers, access times are mainly determined by the data transfer rate. This paper presents a novel performance-oriented data reorganizin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998